首页> 外文OA文献 >Accelerating Lattice QCD Multigrid on GPUs Using Fine-Grained Parallelization

【2h】

Accelerating Lattice QCD Multigrid on GPUs Using Fine-Grained Parallelization

机译：利用细粒度加速GpU上的Lattice QCD多重网格并行

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The past decade has witnessed a dramatic acceleration of lattice quantumchromodynamics calculations in nuclear and particle physics. This has been dueto both significant progress in accelerating the iterative linear solvers usingmulti-grid algorithms, and due to the throughput improvements brought by GPUs.Deploying hierarchical algorithms optimally on GPUs is non-trivial owing to thelack of parallelism on the coarse grids, and as such, these advances have notproved multiplicative. Using the QUDA library, we demonstrate that by exposingall sources of parallelism that the underlying stencil problem possesses, andthrough appropriate mapping of this parallelism to the GPU architecture, we canachieve high efficiency even for the coarsest of grids. Results are presentedfor the Wilson-Clover discretization, where we demonstrate up to 10x speedupover present state-of-the-art GPU-accelerated methods on Titan. Finally, welook to the future, and consider the software implications of our findings.

机译：过去十年见证了核物理和粒子物理中晶格量子色动力学计算的显着加速。这是由于在使用多网格算法加速迭代线性求解器方面取得了重大进展，也归功于GPU带来的吞吐量提高。由于在粗糙网格上缺乏并行性，因此在GPU上最佳地部署分层算法并非易事。因此，这些进步尚未被证明具有可乘性。使用QUDA库，我们证明了通过公开底层模板问题所具有的所有并行性来源，并通过将该并行性适当映射到GPU架构，即使对于最粗糙的网格，也可以实现高效率。给出了Wilson-Clover离散化的结果，其中我们证明了Titan上现有的GPU加速方法的速度提高了10倍。最后，我们展望未来，并考虑我们发现的软件含义。

著录项

作者
Clark, M. A.; Joó, Bálint; Strelchenko, Alexei; Cheng, Michael; Gambhir, Arjun; Brower, Richard;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Fine-grained Parallelization Of Lattice Qcd Kernel Routine On Gpus [J] . Khaled Z. Ibrahim, Francois Bodin, Olivier Pene Journal of Parallel and Distributed Computing . 2008,第10期

机译：Gpus上晶格Qcd内核例程的细粒度并行化
2. Accelerating lattice QCD simulations with 2 flavors of staggered fermions on multiple GPUs using OpenACC-A first attempt [J] . Gupta Sourendu, Majumdar Pushan Computer physics communications . 2018,第期

机译：使用OpenACC-A尝试，加速晶格QCD模拟用2种夸张的交错费米氏术进行了两种翻折号
3. Lattice QCD on parallel computers - Proceedings of the International Workshop on Lattice QCD on Parallel Computers - Tsukuba, Ibaraki, Japan - 10-15 March 1997 - Preface [J] . Iwasaki Y., Ukawa A. Nuclear physics, B . 1998,第S60A期

机译：并行计算机上的Lattice QCD-并行计算机上的Lattice QCD国际研讨会论文集-日本茨城县筑波市-1997年3月10日至15日-前言
4. Accelerating Lattice QCD Multigrid on GPUs Using Fine-Grained Parallelization [C] . M. A. Clark, Bálint Joó, Alexei Strelchenko, International Conference for High Performance Computing, Networking, Storage and Analysis . 2016

机译：使用细粒度并行化在GPU上加速莱迪思QCD多重网格
5. GPU accelerated study of heat transfer and fluid flow by lattice Boltzmann method on CUDA. [D] . Ren, Qinlong. 2016

机译：GPU在CUDA上通过格子Boltzmann方法加速了传热和流体流动的研究。
6. A regression algorithm for accelerated lattice QCD that exploits sparse inference on the D-Wave quantum annealer [O] . Nga T. T. Nguyen, Garrett T. Kenyon, Boram Yoon -1

机译：利用D波量子退火器上的稀疏推断的加速晶格QCD回归算法
7. Accelerating lattice QCD simulations with 2 flavours of staggered fermions on multiple GPUs using OpenACC - a first attempt [O] . Gupta, Sourendu, Majumdar, Pushan 2017

机译：加速格子QCD模拟与2种交错的风格使用OpenaCC的多个GpU上的费米子 - 第一次尝试

Accelerating Lattice QCD Multigrid on GPUs Using Fine-Grained Parallelization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅